Change-Oriented Summarization of Temporal Scholarly Document Collections by Semantic Evolution Analysis
نویسندگان
چکیده
The number of scholarly publications has dramatically increased over the last decades. For anyone new to a particular science domain it is not easy understand major trends and significant changes that undergone time. Temporal summarization related approaches should be then useful make sense temporal collections. In this paper we demonstrate an approach analyze dataset research papers by providing high level overview important occurred time in dataset. novelty our lies adaptation methods used for semantic term evolution analysis. However, just single words independently, but estimate common drifts shared groups semantically converging words. As example study ACL Anthology Reference Corpus spans from 1974 2015 contains 22,878 articles.
منابع مشابه
Fuzzy Clustering for Topic Analysis and Summarization of Document Collections
Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common and distinctive topics within a document set, together with the generation of multi-document summaries, can greatly ease the burden of information management. We show how this can be achieved with a clustering algorithm based on ...
متن کاملChange Summarization in Web Collections
World Wide Web is not only enormous but also dynamic information space. Every day large quantity of new information is published on web pages. Many times people want to know what are the major changes in their area of interest over a given time period. This paper addresses the problem of summarizing changes in web collections devoted to a common topic. We have created a system called ChangeSumm...
متن کاملSemantic Wordification of Document Collections
Word clouds have become one of the most widely accepted visual resources for document analysis and visualization, motivating the development of several methods for building layouts of keywords extracted from textual data. Existing methods are effective to demonstrate content, but are not capable of preserving semantic relationships among keywords while still linking the word cloud to the underl...
متن کاملCreating synthetic temporal document collections
In research in temporal document databases, large temporal document collections are necessary in order to be able to compare and evaluate new strategies and algorithms. Large temporal document collections are not easily available, and an alternative is to create synthetic document collections. In this paper we will describe how to generate synthetic temporal document collections, how this is re...
متن کاملImproving self-organization of document collections by semantic mapping
In text management tasks, the dimensionality reduction becomes necessary to computation and interpretability of the results generated by machine learning algorithms. This paper describes a feature extraction method called semantic mapping. Semantic mapping, sparse random mapping and PCA are applied to self-organization of document collections using self-organizing map (SOM). The behaviors of th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3135051